Goto

Collaborating Authors

 synthesizable molecule


A Model to Search for Synthesizable Molecules

Neural Information Processing Systems

Deep generative models are able to suggest new organic molecules by generating strings, trees, and graphs representing their structure. While such models allow one to generate molecules with desirable properties, they give no guarantees that the molecules can actually be synthesized in practice. We propose a new molecule generation model, mirroring a more realistic real-world process, where (a) reactants are selected, and (b) combined to form more complex molecules. More specifically, our generative model proposes a bag of initial reactants (selected from a pool of commercially-available molecules) and uses a reaction model to predict how they react together to generate new molecules. We first show that the model can generate diverse, valid and unique molecules due to the useful inductive biases of modeling reactions. Furthermore, our model allows chemists to interrogate not only the properties of the generated molecules but also the feasibility of the synthesis routes. We conclude by using our model to solve retrosynthesis problems, predicting a set of reactants that can produce a target product.


SynCoGen: Synthesizable 3D Molecule Generation via Joint Reaction and Coordinate Modeling

Rekesh, Andrei, Cretu, Miruna, Shevchuk, Dmytro, Somnath, Vignesh Ram, Liò, Pietro, Batey, Robert A., Tyers, Mike, Koziarski, Michał, Liu, Cheng-Hao

arXiv.org Artificial Intelligence

Ensuring synthesizability in generative small molecule design remains a major challenge. While recent developments in synthesizable molecule generation have demonstrated promising results, these efforts have been largely confined to 2D molecular graph representations, limiting the ability to perform geometry-based conditional generation. In this work, we present SynCoGen (Synthesizable Co-Generation), a single framework that combines simultaneous masked graph diffusion and flow matching for synthesizable 3D molecule generation. SynCoGen samples from the joint distribution of molecular building blocks, chemical reactions, and atomic coordinates. To train the model, we curated SynSpace, a dataset containing over 600K synthesis-aware building block graphs and 3.3M conformers. SynCoGen achieves state-of-the-art performance in unconditional small molecule graph and conformer generation, and the model delivers competitive performance in zero-shot molecular linker design for protein ligand generation in drug discovery. Overall, this multimodal formulation represents a foundation for future applications enabled by non-autoregressive molecular generation, including analog expansion, lead optimization, and direct structure conditioning.


A Model to Search for Synthesizable Molecules

Neural Information Processing Systems

Deep generative models are able to suggest new organic molecules by generating strings, trees, and graphs representing their structure. While such models allow one to generate molecules with desirable properties, they give no guarantees that the molecules can actually be synthesized in practice. We propose a new molecule generation model, mirroring a more realistic real-world process, where (a) reactants are selected, and (b) combined to form more complex molecules. More specifically, our generative model proposes a bag of initial reactants (selected from a pool of commercially-available molecules) and uses a reaction model to predict how they react together to generate new molecules. We first show that the model can generate diverse, valid and unique molecules due to the useful inductive biases of modeling reactions.


Reviews: A Model to Search for Synthesizable Molecules

Neural Information Processing Systems

Originality: The proposed model (Molecule-Chef) is a novel combination of existing deep learning models - like Graph Neural Networks, RNNs, etc. to solve the task of molecule searching and to provide a synthesis recipe for the same. The authors also ensure the validity of products by restricting the latent space to chemical reactants that are readily available to chemists. Quality: The authors compare Molecule-Chef with state-of-the-art baselines and report improved/comparable results. By restricting the latent space, the model produces more valid molecular products as compared to other models. The paper also presents results of retrosynthesis, in which given some molecular products, the decoder of Molecule-Chef can be used to generate the possible combinations of reactants that were used to create the product.


Reviews: A Model to Search for Synthesizable Molecules

Neural Information Processing Systems

The authors propose a deep learning method to generate molecular structures that are synthesizable and give a recipe for their synthesis from reactant molecules. The reviewers fond the contribution be significant for practioners.


A Model to Search for Synthesizable Molecules

Neural Information Processing Systems

Deep generative models are able to suggest new organic molecules by generating strings, trees, and graphs representing their structure. While such models allow one to generate molecules with desirable properties, they give no guarantees that the molecules can actually be synthesized in practice. We propose a new molecule generation model, mirroring a more realistic real-world process, where (a) reactants are selected, and (b) combined to form more complex molecules. More specifically, our generative model proposes a bag of initial reactants (selected from a pool of commercially-available molecules) and uses a reaction model to predict how they react together to generate new molecules. We first show that the model can generate diverse, valid and unique molecules due to the useful inductive biases of modeling reactions.


A Smarter Way To Develop New Drugs Using Artificial Intelligence

#artificialintelligence

MIT scientists have developed a machine learning model that proposes new molecules for the drug discovery process, while ensuring the molecules it suggests can actually be synthesized in a laboratory. A new artificial intelligence technique has been developed that only proposes candidate molecules that can actually be produced in a lab. Pharmaceutical companies are using artificial intelligence to streamline the process of discovering new medicines. Machine-learning models can propose new molecules that have specific properties which could fight certain diseases, accomplishing in minutes what might take humans months to achieve manually. But there's a major hurdle that holds these systems back: The models frequently suggest new molecular structures that are difficult or impossible to produce in a laboratory.


A smarter way to develop new drugs

#artificialintelligence

Pharmaceutical companies are using artificial intelligence to streamline the process of discovering new medicines. Machine-learning models can propose new molecules that have specific properties which could fight certain diseases, doing in minutes what might take humans months to achieve manually. But there's a major hurdle that holds these systems back: The models often suggest new molecular structures that are difficult or impossible to produce in a laboratory. If a chemist can't actually make the molecule, its disease-fighting properties can't be tested. A new approach from MIT researchers constrains a machine-learning model so it only suggests molecular structures that can be synthesized.


A Model to Search for Synthesizable Molecules

Bradshaw, John, Paige, Brooks, Kusner, Matt J., Segler, Marwin, Hernández-Lobato, José Miguel

Neural Information Processing Systems

Deep generative models are able to suggest new organic molecules by generating strings, trees, and graphs representing their structure. While such models allow one to generate molecules with desirable properties, they give no guarantees that the molecules can actually be synthesized in practice. We propose a new molecule generation model, mirroring a more realistic real-world process, where (a) reactants are selected, and (b) combined to form more complex molecules. More specifically, our generative model proposes a bag of initial reactants (selected from a pool of commercially-available molecules) and uses a reaction model to predict how they react together to generate new molecules. We first show that the model can generate diverse, valid and unique molecules due to the useful inductive biases of modeling reactions.


The Synthesizability of Molecules Proposed by Generative Models

Gao, Wenhao, Coley, Connor W.

arXiv.org Machine Learning

The discovery of functional molecules is an expensive and time-consuming process, exemplified by the rising costs of small molecule therapeutic discovery. One class of techniques of growing interest for early-stage drug discovery is de novo molecular generation and optimization, catalyzed by the development of new deep learning approaches. These techniques can suggest novel molecular structures intended to maximize a multi-objective function, e.g., suitability as a therapeutic against a particular target, without relying on brute-force exploration of a chemical space. However, the utility of these approaches is stymied by ignorance of synthesizability. To highlight the severity of this issue, we use a data-driven computer-aided synthesis planning program to quantify how often molecules proposed by state-of-the-art generative models cannot be readily synthesized. Our analysis demonstrates that there are several tasks for which these models generate unrealistic molecular structures despite performing well on popular quantitative benchmarks. Synthetic complexity heuristics can successfully bias generation toward synthetically-tractable chemical space, although doing so necessarily detracts from the primary objective. This analysis suggests that to improve the utility of these models in real discovery workflows, new algorithm development is warranted.